Harnessing NLP Techniques in the Processes of Multilingual Content Management
نویسندگان
چکیده
The emergence of the WWW as the main source of distributing content opened the floodgates of information. The sheer volume and diversity of this content necessitate an approach that will reinvent the way it is analysed. The quantitative route to processing information which relies on content management tools provides structural analysis. The challenge we address is to evolve from the process of streamlining data to a level of understanding that assigns value to content. We present an open-source multilingual platform ATALS that incorporates human language technologies in the process of multilingual web content management. It complements a content management software-as-a-service component i-Publisher, used for creating, running and managing dynamic contentdriven websites with a linguistic platform. The platform enriches the content of these websites with revealing details and reduces the manual work of classification editors by automatically categorising content. The platform ASSET supports six European languages. We expect ASSET to serve as a basis for future development of deep analysis tools capable of generating abstractive summaries and training models for decision making systems.
منابع مشابه
What can NLP techniques do for eLearning?
The aim of the Language Technology for eLearning project is to show is to show that current results achieved in the area of Natural Language Processing and the Semantic Web, (i.e. ontologies) can play a relevant role in improving the functionality of existing Learning Management Systems (LMS). In this paper, we discuss how current NLP techniques have been employed for the development of a keywo...
متن کاملManaging Multimodal and Multilingual Semantic Content
With the advent and increasing popularity of Semantic Wikis and the Linked Data the management of semantically represented knowledge became mainstream. However, certain categories of semantically enriched content, such as multimodal documents as well as multilingual textual resources are still difficult to handle. In this paper, we present a comprehensive strategy for managing the life-cycle of...
متن کاملDeveloping customer relationship management (CRM) processes using standard frameworks and MCDM techniques
Organizations have found that functional approach to business destroys flexibility and agility by gaining experience over the time. The main weakness with Task-based organizations is that they can hardly act flexibly and adapt themselves to the changing environment. In today’s' dynamic and competitive business world, organizations focus significantly on managing and improving their business pro...
متن کاملGrammar Sharing Techniques for Rule-based Multilingual NLP Systems
Rule-based multilingual natural language processing (NLP) applications such as machine translation systems require the development of grammars for multiple languages. Grammar writing, however, is often a slow and laborious process. In this paper we describe a methodology for multilingual and multipurpose grammar development based on grammar sharing. This paper presents the first step towards a ...
متن کاملATLAS – The Multilingual Language Processing
This paper presents the ATLAS platform – multilingual language processing framework integrating the common set of linguistic tools for a group of European languages (less-resourced: Bulgarian, Croatian, Greek, Polish and Romanian together with English and German as reference languages). State-of-the-art NLP functionality offered by the platform allows for multilingual annotation of texts on low...
متن کامل